u3: optimizes u3i_edit (nock opcode 10) #362

joemfb · 2023-05-01T18:55:49Z

This PR ports urbit/urbit#6004. It is not urgent.

Benchmarking this against the old implementation would be worthwhile. It might also be worth inlining/decomposing u3a_is_mutable(); this is (currently) its only call-site, and the road pointer and seniority predicate cannot change throughout this loop.

ashelkovnykov · 2023-11-15T23:31:42Z

This PR came up today during a sync between @joemfb and I. I skimmed it to make sure I understood the motivation and how it spiritually aligns w.r.t. similar noun walks in Ares, but I didn't verify the logic carefully enough to really "review" it.

This reverts commit a6debdb.

joemfb · 2023-11-16T21:28:03Z

I've written a benchmark for this code, and run the various versions of this operation through it. The benchmark constructs a list of 1s, and then edits the list at the axis of each item, replacing the 1 with 2. This is an absurd operation, and no real-world code we ever run would look like it. But it's the best-case scenario for certain optimizations here, and gives us an opportunity to compare them:

opcode 10 variation	1k list	10k list
develop (`a244cc5`)	19ms	2.41s
^ without mutation (`a6debdb`)	26ms	3.218s
loop (`d6988e6`)	1ms	147ms
reduce branches (`fa84fd8`)	1ms	170ms
inline axis bit-math (`e182de8`)	1ms	159ms
^ without mutation	6ms	667ms
inline axis without branch reduction	1ms	130ms

Notably, the mutation optimization (we can edit a cell in place if it is on our road and has a refcount of 1) is only a ~30% improvement with the old implementation, but is a 5-6x improvement in the new.

I suspect that the branch reduction being slower is an artifact of this benchmark -- we're almost always editing the tail of a cell (lists associate to the right), making branch prediction very effective. So I've left it in for now, but I'm open to being talked out of that.

ashelkovnykov

Re-reviewed later, and then just now w/ Joe

joemfb requested a review from a team as a code owner May 1, 2023 18:55

jalehman assigned barter-simsum Jun 26, 2023

joemfb added 6 commits November 16, 2023 15:49

vere: adds benchmark for opcode 10

a244cc5

TMP remove mutation optimization from u3i_edit()

a6debdb

Revert "TMP remove mutation optimization from u3i_edit()"

acd4a94

This reverts commit a6debdb.

u3: rewrites u3i_edit into a loop (tail recursion modulo cons)

d6988e6

u3: further optimizes u3i_edit, removing head/tail branches

fa84fd8

u3: further optimizes u3i_edit, inlining axis bit math

e182de8

joemfb force-pushed the jb/edit-faster branch from bec84ac to e182de8 Compare November 16, 2023 21:27

joemfb requested a review from eamsden November 16, 2023 21:28

ashelkovnykov approved these changes Nov 29, 2023

View reviewed changes

joemfb merged commit 65f03b3 into develop Nov 29, 2023
5 checks passed

joemfb deleted the jb/edit-faster branch November 29, 2023 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

u3: optimizes u3i_edit (nock opcode 10) #362

u3: optimizes u3i_edit (nock opcode 10) #362

joemfb commented May 1, 2023

ashelkovnykov commented Nov 15, 2023

joemfb commented Nov 16, 2023

ashelkovnykov left a comment

u3: optimizes u3i_edit (nock opcode 10) #362

u3: optimizes u3i_edit (nock opcode 10) #362

Conversation

joemfb commented May 1, 2023

ashelkovnykov commented Nov 15, 2023

joemfb commented Nov 16, 2023

ashelkovnykov left a comment

Choose a reason for hiding this comment